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0.  These  are  the  Proceedings  of  the  Twelfth  Symposium  in 
Applied  Mathematics,  held  in  New  York  City,  April  14-15,  I960. 

The  papers  can  roughly  be  grouped  into  the  following  five  categories: 

(1)  Linguistics:  Y.  R.  Chao,  Graphic  and  Phonetic  Aspects  of  Lin¬ 
guistic  and  Mathematical  Symbols  (69-82),  Gordon  E.  Peterson  and 
Frank  Har ary,  Foundations  in  Phonemic  Theory  (139-65),  H.  A. 
Gleason,  Jr.  ,  Genetic  Relationship  Among  Languages  (179-89); 

(2)  Mathematical  and  Logical  Method:  W.  V.  Quine,  Logic  as  a 
Source  of  Syntactical  Insights  (1-5),  Nelson  Goodman,  Graphs  for 
Linguistics  (51-55),  Haskell  B.  Curry,  Some  Logical  Aspects  of 
Grammatical  Structure  (56-68),  Robert  Abernathy,  The  Problem  of 
Linguistic  Equivalence  (95-8),  Anthony  G.  Oettinger,  Automatic 
Syntactic  Analysis  and  the  Pushdown  Store  (104-29);  (3)  Pattern 
Recognition:  Murray  Eden,  On  the  Formalization  of  Handwriting; 

(4)  Models  of  Linguistic  Structure:  Noam  Chomsky,  On  the  Notion 


Work  on  this  review  was  done  under  the  sponsorship  of  the  Air 
Force  Office  of  Scientific  Research  of  the  Office  of  Aerospace 
Research,  under  Contract  No.  AF  47(638)-l  128. 
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"Rule  of  Grammar"  (6-24),  Hilary  Putnam,  Some  Issues  in  the 
Theory  of  Grammar  (25-42),  Henry  Hi4,  Congrammaticality, 
Batteries  of  Transformations  and  Grammatical  Categories  (43-50), 
Morris  Halle,  On  the  Role  of  Simplicity  in  Linguistic  Description 
(89-94),  Hans  G.  Herzberger,  The  Joints  of  English  (99-103), 

Victor  H.  Yngve,  The  Depth  Hypothesis  (130-8),  Joachim  Lambek, 

On  the  Calculus  of  Syntactic  Types  (166-78);  (5)  Probability  and 
Information  Theory:  Benoit  Mandelbrot,  On  the  Theory  of  Word 
Frequencies  and  on  Related  Markovian  Models  of  Discourse  (190- 
219),  Charles  F.  Hockett,  Grammar  for  the  Hearer  (220-36), 

Rulon  Wells,  A  Measure  of  Subjective  Information  (237-44),  Roman 
Jakobson,  Linguistics  and  Communications  Theory  (245-52).  A 
summary  of  the  discussion  appears  at  the  end  of  the  volume  (253-68). 

The  papers  dealing  with  models  of  linguistic  structure  rep¬ 
resent  the  central  theme  of  the  volume;  also,  their  subject  matter 
seems  to  have  captured  the  imagination  of  linguists  more  than  the 
other  topics.  This  is  the  portion  of  the  book,  therefore,  which  I 
shall  emphasize  in  this  review.  I  shall  take  an  outside  view  of  the 
approach  represented  in  it,  by  discussing  it  from  the  viewpoint  of 
anthropological  linguistics,  rather  than  in  terms  of  its  own  premises. 
In  this  connection,  I  should  like  to  characterize  the  anthropological 
linguistic  approach  by  two  aims:  as  an  immediate  aim,  the  data- 
centered  description  of  a  great  variety  of  languages,  including  many 
non-literate  languages;  as  an  ultimate  aim,  the  classification  of  the 
relation  of  language  to  culture,  both  in  a  general  theoretical  sense, 
and  in  the  particular  historical  sense  of  relating  linguistic  history 
to  ethno-history. 

The  various  models  of  structure  are  certainly  not  identical 
with  each  other,  and  their  proponents  are  often  in  open  disagreement 
(cf,  R.  B.  Lees'  Critique  of  Yngve  in  his  comments  to  Hockett's 
paper,  266).  Nevertheless,  an  outsider  is  able  to  spot  certain  re¬ 
current  conceptual  motifs  that  can  serve  to  characterize  the  "model¬ 
ing"  approach  as  a  whole.  These  concepts  will  be  presented  in  the 
form  of  a  brief  terminological  glossary,  organized  in  the  following 
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five  conceptual  categories  which  are  here  presented  in  what  I  con¬ 
sider  a  decreasing  order  of  generality: 

1.  models 

2.  the  nature  of  grammar 

3.  types  of  grammars 

4.  the  elements  of  language 

5.  rules. 

Finally,  the  question  of  simplicity  —  the  most  significant  value 
criterion  of  the  modeling  approach — will  be  commented  on. 


1.  Models.  Y.  R.  Chao  in  a  recent  paper  cites  30  different 

2 

"Synonyms  and  Characterizations  of  'Model',"  showing  the  wide 
diversity  of  opinion  and  usage  in  regard  to  this  term. 

3 

My  own  view  is  summed  up  in  a  previous  paper:  "A  Model 

can  be  a  frame  for  the  presentation  of  results  or  a  frame  for  the 
acquisition  of  knowledge.  "  The  authors  of  the  papers  under  review 
are  primarily  interested  in  the  former,  and  the  manner  in  which  the 
results  are  obtained  is  often  considered  irrelevant.  Most  anthro¬ 
pological  linguists  would  be  more  interested  in  models  in  the  sense 
of  a  frame  for  the  acquisition  of  knowledge.  In  both  senses,  a  model 
roughly  corresponds  to  the  earlier  "theoretical  framework"  or  "con¬ 
ceptual  framework.  " 


Y.  R.  Chao,  Models  in  Linguistics  and  Models  in  General, 

Logic,  Methodology  and  Philosophy  of  Science,  Pr peelings  of  the 
19o0  International  Congress,  edited  by  Ernest  Nagel,  Patrick  Suppes 
and  Alfred  Tarski,  pp.  563-4  (Stanford,  Calif.  ,  1962). 

3 

Paul  L.  Garvin,  From  Model  to  Procedure,  Proceedings  of  the 
National  Symposium  on  Machine  Translation,  held  at  the  University 
of  California,  Los  Angeles,  February  2-5,  I960,  edited  by  H.  P. 
Edmundson,  p.  367  (Englewood  Clif/s,  N.  J.  ,  1961). 
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Models  can  be  quantitative  or  qualitative.  The  ambition  of  the 

authors  under  review  is  to  produce  quantitative  models  of  language; 

the  technique  of  anthropological  linguistics  can,  paraphrasing 
4 

A.  L.  Kroeber,  be  characterized  as  that  of  "qualitative  contrast.  " 


2.  The  Nature  of  Grammar.  Chomsky's  original  definition 
that  a  grammar  is  "a  device  that  generates  all  of  the  grammatical 
sequences  of  ^a  languagej  and  none  of  the  ungrammatical  ones"^  is 
acceptable  to  most  model-oriented  authors  in  the  book.  The  following 
concepts  are  relevant  to  it. 

2.  1  Device  (12,  130).  This  need  not  be  a  device  in  the  literal 
sense  but  is  primarily  an  "abstract  logical"  device,  which  is  to  say, 
"a  set  of  rules"  (6).  It  is  closely  related  to  an 

Automaton  (7,  13-4)  which  is  an  automatic  device,  both  in 
the  above  abstract  logical  sense,  and  in  the  literal  sense  of  an  auto¬ 
matic  machine  or  at  least  a  computer  program.  Some  language 
modelers  are  interested  in  the  design  of  sentence-generating  auto¬ 
mata  in  the  literal  sense  of  computer  programs;^  others  are  satis¬ 
fied  with  the  verbal  description  of  abstract  logical  devices. 


■z - 

A.  L.  Kroeber,  Ethnographic  Interpretations,  UCPAAE  47.198  (1957): 
".  .  .  it  has  mainly  operated  with  qualitative  precision  of  definition  of 
pattern,  sharpened  in  recent  decades  by  successful  use  of  the  principle 
of  contrast.  " 
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Noam  Chomsky,  Syntactic  Structures,  p.  13  ('s  Gravenhage,  1957). 

^Victor  H.  Yngve,  Random  Generation  of  English  Sentences, 
Proceedings  of  the  First  International  Conference  on  Machine  Trans¬ 
lation  of  Languages  and  Applied  Language  Analysis  (H.  M.  S.  O.  ,  London, 
in  press). 
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A  Turing  Machine  (39  ff. )  is  an  automaton  with  no  re¬ 
strictions  on  its  storage  or  logical  capacity,  that  is,  with  an  infinite 
memory  and  circuitry.  Such  a  device  is  by  definition  an  engineer¬ 
ing  impossibility  and  therefore  has  to  remain  abstract.  It  is  of 
great  theoretical  interest  to  mathematicians.  Most  modelers  of 
language  would  like  their  grammar  device  not  to  be  a  Turing  ma¬ 
chine,  since  a  Turing  machine  could  simply  contain  in  its  infinite 
memory  all  conceivable  sequences  of  a  language  and  generate  them 
from  this  store.  This  type  of  generation  would  not  require  rules, 
and  would  hence  be  uninteresting. 

To  most  anthropological  linguists,  the  grammar  of  a  language 
is  not  a  device  but  a  part  of  the  culture  and/or  its  description. 


2.2  Generate,  generation  (pp.  17ff. ).  The  terms  produce, 
production  (pp.  131  ff . )  can  be  used  as  synonyms  if  the  grammar 
device  is  considered  in  the  literal,  not  merely  the  abstract  sense. 

The  term  "generation"  is  not  necessarily  understood  in  a 
literal  sense  (though  it  may  be,  if  the  device  is  considered  in  the 
literal  sense).  Rather,  it  is  used  to  designate  the  capacity  of  a  set 
of  rules  to  characterize  (42)  or  to  give  a  complete  specification  of 
(6)  the  sequences  to  which  these  rules  apply.  A  grammar  which  con¬ 
sists  of  such  a  set  of  rules  is  then  considered  a  generative  grammar, 
showing  the  generative  paths  (43)  by  which  these  sequences  are  said 
to  have  come  about. 

The  comparable  concept  in  anthropological  linguistics  is  for 
a  description  to  account  for  the  data,  both  those  actually  at  hand  and, 
ideally,  those  yet  to  be  collected.  An  ideal  description  in  anthro¬ 
pological  linguistics  will  give  a  complete  account  of  the  system  of 
a  language,  just  as  an  ideal  generative  grammar  will  give  a  com¬ 
plete  specification. 

The  origin  of  the  term  "generate"  is  in  mathematical  parlance, 
where  a  particular  definition  is  said  to  generate  a  set  of  entities. 
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But  the  linguistic  uses  of  the  term  lack  the  precision  of  its  use  in 
mathematics. 

2.  3  Grammatical  Sentences  (40  ff,  ,  167  ff.  ).  This  and  not 
deviant  utterances  (25  ff. )  is  what  the  sequences  generated  by  the 
device  must  be,  in  order  for  the  device  to  be  a  grammar,  i.  e.  ,  an 
adequate  model  of  the  language.  Lambek  has  "been  unable  to  find 
agreement  among  modern  linguists  as  to  what  constitutes  a  gram¬ 
matical  sentence"  (167).  To  the  anthropological  linguist,  gram¬ 
matical  sentences  are  the  equivalent  of  culturally  acceptable  speech 
behavior. 

2.4  "All  and  only"  (7,  27)  the  grammatical  sentences  of  a 
language.  This  is  what  a  grammar  device  must  be  capable  of  account¬ 
ing  for.  It  is  not  sufficient  to  describe  the  forms  contained  in  an 
actually  collected  corpus.  The  grammar  device  must  completely 
specify  all  grammatical  sentences,  and  hence  at  the  same  time  re¬ 
ject  all  ungrammatical  sentences.  The  difficulty  of  achieving  this 
aim  is  discussed  by  Putnam  (26  ff. );  it  is  related  to  the  problem  of 
defining  grammaticalness  referred  to  further  above. 

Model -oriented  theorists  of  language  often  misunderstand  the 
use  of  a  corpus  in  anthropological  linguistics.  The  aim  of  the  anthro¬ 
pological  linguist  is  to  infer  from  the  corpus  obtained  in  the  course  of 
field  work  the  kind  of  description  which  will  not  be  invalidated  by 
additional  data.  This  is  achieved  by  basing  a  first  approximation  of 
a  description  on  a  given  corpus,  and  then  cross-checking  it  against 
additional  field  data.  When  a  point  of  diminishing  returns  is  reached 
in  this  process  of  cross-checking,  it  is  assumed  that  no  discrepancies 
can  reasonably  be  expected,  and  hence  that  the  description  is  valid 
and  applies  to  the  language  in  general.  This  is  a  less  ambitious  as¬ 
sertion  than  the  one  about  "all  and  only,  "  but  it  says  very  much  the 
same,  albeit  more  realistically. 
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3.  Types  of  Grammars.  The  major  dispute  here  seems  to 

be  whether  or  not  a  finite-state  device  (137)  can  be  an  adequate  model 

of  language.  In  non-model  terminology,  the  question  is  whether  or 

not  speech  behavior  can  be  adequately  described  as  a  probabilistic 

succession  of  finite  states.  In  linguistics,  the  debate  seems  to  have 

been  triggered  by  Hockett's  description  of  the  nature  of  speech  com- 

7 

munication  in  terms  of  certain  aspects  of  probability  theory. 

No  anthropological  linguist  will  deny  that  the  probabilistic  as¬ 
pect  of  speech  behavior  is  significant,  although  probabilistic  calcula¬ 
tions  have  so  far  been  applied  in  historical  linguistics  (lexicostatistics), 
and  not  yet  in  grammatical  description.  The  prerequisite  for  the 
application  of  probability  theory  is  a  knowledge  of  what  the  elements 
are  that  are  to  be  considered  in  the  calculation.  Viewed  in  this  light, 
matters  of  probability  —  including  questions  of  finite  state  models  — 
presuppose  a  grammar  rather  than  constitute  one. 

4.  The  Elements  of  Language.  Here  is  where  the  anthro¬ 
pological  linguist  finds  himself  on  least  familiar  ground.  Not  only 
are  a  number  of  familiar  terms  either  missing  or  given  less  prom¬ 
inence,  but  the  key  terms  in  the  model -oriented  discussion  of  language 
tend  to  be  unfamiliar. 

4.  1  Set,  as  in  set  of  strings  (8)  or  set  of  sentences  (6,  99). 

The  term  comes  from  mathematics:  a  set,  in  the  mathematical 
sense,  is  "any  collection  of  objects  ....  A  set  is  defined  by  speci¬ 
fying  which  objects  are  elements  of  the  set.  This  may  be  done  by 
stating  some  rule  by  which  the  elements  can  be  identified,  or  by 

g 

actually  putting  the  elements  on  display.  "  (i.  e.  ,  enumerating  them). 

_ 

C.  F.  Hockett,  A  Manual  of  Phonology,  pp.  6-11  (Baltimore,  1955). 

g 

Irving  Allen,  The  New  Mathematics,  p.  36  (A  Mentor  Book, 

New  York,  196TJ]’! 
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As  used  by  the  modelers,  the  set  of  strings  or  sentences  is 
defined  neither  by  an  identifying  rule  nor  by  an  enumeration  of  the 
elements.  The  elements  are  identified  simply  by  giving  a  verbal 
label. 


4.2  Sentences  (passim).  These  are  primary  givens  in  model¬ 
ing  terminology.  Anthropological  linguists  might  want  to  know  how 
sentences  are  defined,  or  how  they  differ  from  utterances.  This 
question  is  hinted  at  by  Putnam  when  he  cites  Ziff's  term  "deviant 
utterance"  (25)  and  asks  why  it  is  not  called  "sentence.  " 

4.  3  Strings  (8  ff.  ,  100  ff. )  as  in  strings  of  symbols  (8). 

These  are  simply  sequences.  The  term  "string"  is  used  in  mathe¬ 
matics  as  an  informal  notion  denoting  a  sequence  of  symbols  of  any 
sort. 

As  used  in  the  volume  under  review,  relations  are  not  included 
as  part  of  these  strings;  they  are  expressed  by  the  rules  which  are 
applied  to  the  strings  (see  5.  below). 

4.4  Symbols  (passim).  In  modeling  terminology,  the  smallest 
elements  of  a  language  are  symbols.  By  symbols  are  meant,  not 
symbols  in  the  behavioral  sense,  but  graphic  symbols  as  the  logician 
manipulates  them.  Language  is  said  to  consist  of  strings  (or  se¬ 
quences,  42  ff.  )  of  such  symbols.  Anthropological  linguists  would 
like  to  see  greater  emphasis  placed  on  what  the  symbols  stand  for 
and  how  they  were  arrived  at.  Some  of  them  would  also  prefer  to  re¬ 
strict  the  use  of  symbols  to  phonemics  and  a  few  clearcut  cases  else¬ 
where  in  the  grammar  (such  as  some  morphophonemic  variations  or 
simple  relations  in  syntax),  and  to  be  content  with  verbal  statements 
for  the  most  part, 

4.  5  Trees  (8  ff.  ,  100  ff.  ,  131  ff. )  provide  a  graphic  repre¬ 
sentation  of  the  immediate  -  constituent  analysis  of  strings.  Trees 
have  nodes  (11,  51,  132)  at  the  points  where  their  branches  (11) 
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separate,  as  well  as  at  the  ends  of  the  branches.  In  a  properly 
drawn  tree,  there  is  a  symbol  at  each  node. 

Trees  are  in  essence  modern  versions  of  the  old  parsing 
diagrams;  their  design  depends  on  the  interpretation  of  the  con¬ 
stituent  structure  which  they  represent.  The  modelers  are  not 
interested  in  how  this  structure  was  discovered.  Trees  are  con¬ 
sidered  a  more  efficient  means  of  representation  than  the  older 

9 

diagrams  such  as  the  one  shown  in  Hockett's  text  book,  and  they 
are  easier  to  draw, 

4.  6  Nesting  (111  ff . )  or  embedding  (10  ff . )  is  the  character  - 
istic  of  a  given  construction  of  being  enclosed  within  another.  This 
is  one  of  the  properties  of  natural  human  language  that  makes  its 
modeling  difficult.  Related  to  it  is  the  question  of  depth  (130  ff. ), 
where  the  problem  of  the  permitted  number  of  nestings  is  con¬ 
sidered.  This  appears  to  be  a  subject  of  some  controversy;  Yngve 
(ibid.)  accepts  George  Miller's  statement^^  that  the  maximum  num¬ 
ber  of  nestings,  because  of  limitations  on  human  memory,  is  seven. 
Others  hold  that  nesting  is  potentially  infinite. 

Anthropological  linguists  would  here  ask  a  somewhat  different 
question.  Rather  than  assuming  nesting  as  a  universal,  they  would 
ask  whether  all  languages  allow  nesting,  and  of  those  that  do,  to 
what  extent  they  allow  it. 

5.  Rules  (6  ff.  ,  89  ff. ).  These  are  the  modeling  equivalent 
of  the  anthropological  linguist's  statements.  They  differ  from  state¬ 
ments  by  being  required  to  have  a  narrowly  circumscribed  form 
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C.  F,  Hockett,  A  Course  in  Modern  Linguistics,  pp.  150  ff. 

(New  York,  1958)"!  Note,  however,  that  in  the  volume  under  re¬ 
view  he  also  presents  a  tree  diagram  (Z32). 

^^George  A.  Miller,  Human  Memory  and  the  Storage  of  Informa¬ 
tion,  IRE  Transactions  on  Information  Theory  IT-2.  129-37  (1956). 
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(where  statements  can  be  more  varied).  An  instance  of  this  are  the 
rewrite  rules  (8),  of  the  form  "rewrite  X  as  Y,  "  for  generating 
constituent  structures.  They  consist  in  rewriting  symbols.  (The 
statement  equivalent  would  be:  units  are  substituted  for  each  other.  ) 
The  restriction  consists  in  requiring,  for  instance,  that  only  one 
symbol  be  rewritten  at  a  time.  No  such  restriction  applies  to  state¬ 
ments,  and  anthropological  linguists  would  consider  it  arbitrary. 

The  reason  for  this  type  of  restriction  is  the  emphasis  on  "describ¬ 
ing  the  form  of  grammars,  which  implies  the  requirement  of  a 
very  particular  form.  It  is  possible,  of  course,  to  envision  any 
other  particular  form — the  choice  is  then  to  be  made  on  grounds  of 
simplicity  (see  6.  below). 

Recursiveness  (22,  29)  is  considered  an  essential  character¬ 
istic  of  rules,  if  they  are  to  model  language  adequately.  In  the 
mathematics  of  discrete  quantities,  a  recursive  rule  is  one  in  which 
a  quantity  is  not  defined  explicitly,  but  implicitly  in  terms  of  its  pre¬ 
decessors.  In  the  volume  under  review,  the  term  appears  to  be 
used  in  the  less  specific  sense  of  iteration — it  seems  to  character¬ 
ize  rules  that  allow  consecutive  re-application.  There  is  no  corre¬ 
sponding  requirement  in  anthropological  linguistics. 

The  order  of  application  of  rules  (91  ff.  ,  131  ff.  )  is  essential 
in  a  generative  model  since  it  is  one  of  the  means  of  achieving  sim¬ 
plicity  (see  below).  The  equivalent  here  is  the  order  of  statements 
in  a  description.  This  is  just  as  significant  to  the  anthropological 
linguist,  but  the  proper  order  of  statements  is  considered  more  a 
matter  of  good  descriptive  procedure  in  general  than  of  simplicity  in 
particular. 


^^Chomsky,  op.  cit.  in  fn.  5,  p.  56. 

1 2 

H.  P.  Edmundson,  personal  communication. 
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6.  Simplicity  (38,  89  ff. ).  Simplicity  is  the  evaluation 
criterion  used  to  judge  models  of  language,  Halle  (89  ff.  )  relates 
simplicity  to  the  order  of  application  of  the  rules;  it  is  significant 
to  note  that  he  is  able  to  discuss  the  concept  in  the  area  of  phonology 
which  most  anthropological  linguists  would  consider  comparatively 
simple  to  begin  with. 

The  criterion  of  simplicity  presupposes  that  the  question  of 
factual  correctness  has  already  been  resolved.  This  condition  is 
not  easily  achieved  in  the  case  of  little -known  languages.  Hence, 
the  question  of  simplicity  does  not  often  arise;  questions  of  con¬ 
sistency  and  particularly  of  exhaustiveness  are  much  more  likely 
to  trouble  the  field  worker  or  comparatist, 

7,  Summary,  We  may  note  the  significant  difference  in 
emphasis  and  interest  between  model -building  and  anthropological 
linguistics.  The  model-builder  assumes  that  a  reliable  description 

is  given  and  is  interested  in  an  "insightful”  re-statement.  The  anthro¬ 
pological  linguist,  on  the  other  hand,  deals  with  data  where  a  de¬ 
scription  is  either  absent  or  unreliable.  He  is  therefore  much  more 
concerned  with  the  original  statement  and  with  its  reliability. 
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